TITLE OF THE INVENTION: 

INTERLEAVING METHOD AND APPARATUS WITH PARALLEL 
ACCESS IN LINEAR AND INTERLEAVED ORDER 

CROSS-REFERENCE TO RELATED APPLICATIONS: 

[0001] This application claims priority of U.S. Provisional Patent 

Application Serial No. 60/448,901 entitled, "Interleaving Method and 

Apparatus with Parallel Access in Linear and Interleaved Order," filed 

February 24, 2003, the entire contents of which are incorporated herein by 

reference. 

BACKGROUND OF THE INVENTION: 
Field of the Invention: 

[0002] The present invention relates to an interleaving method and apparatus 
for providing parallel access in linear and interleaved order to a predetermined 
number of stored data samples, such as a turbo interleaver for a turbo decoder 
in mobile communication systems. 

Description of the Related Art: 

[0003] Code concatenation is a practical technique for obtaining a code with 
a very long block length and a large error-correction capability. This is 
accomplished by combining two elementary codes. These codes have two 
distinct levels of encoding and decoding. The advantage of this coding scheme 
is that sequential decoding of the different codes can be performed. Thus, the 
decoding complexity of the overall code depends on the complexity of the 
decoder associated to each separate code used. This leads to a reduction of the 
decoding complexity. 

[0004] The concept of turbo codes is an iterative decoding of two codes 
concatenated either in parallel or in serial using a Soft Input Soft Output 
(SISO) elementary decoder. Each elementary decoder therefore provides a 
decision and a likelihood ratio which quantifies the probability that the 
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decision is correct. This information is passed to the next decoding stage in 
order to improve the bit error rate at each iteration. A turbo decoder can be 
implemented as certain number of pipelined identical elementary decoders 
depending on the number of iteration. 

[0005] Turbo codes are able to achieve performances with a signal to noise 
ratio close to Shannon's theoretical limit, provided that the code is long 
enough and that a sufficiently large number of iterations is used in the iterative 
decoding process. Convolutional turbo codes are built using a parallel 
concatenation of two Recursive Systematic Convolutional (RSC) codes 
separated by a large random interleaver. The elementary decoder used in the 
iterative decoding process consists of two constituent SISO decoders, one for 
each RSC encoder, an interleaver and a deinterleaver. Further details are 
described for example in "Near Shannon limit error-correction coding: Turbo- 
Codes", C. Berrou et al, in Proc. IEEE ICC '93, Geneva, Switzerland, pp. 
1064-1070, May 1993, incorporated herein by reference. 

[0006] In general, the interleaver randomizes an address of an input 
information or codeword and improves a distance property of the code word. 
It has been decided to use a turbo code in data transmission channels of third 
generation mobile communication systems, e.g. in a data channel of UMTS 
(Universal Mobile Telecommunications System) proposed by ETSI (European 
Telecommunication Standards Institute). 

[0007] Fig. 1 shows a schematic block diagram of a conventional interleaver 
as for example disclosed in document WO 00/70771 for interleaving frame 
data. An address generator 150 generates a read address for changing the 
sequence of input data bits according to an input frame data size and an input 
clock CLK, and provides an interleaver memory 100 with the generated read 
address RA. The interleaver memory 100 sequentially stores input data DI in a 
write mode of operation, and outputs the stored data as output data DO 
according to the read address provided from the address generator 150 in a 
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read mode of operation. A counter 130 counts the input clock and provides the 
clock count value to the interleaver memory 100 as a write address WA. Thus, 
the interleaver sequentially stores input data in the interleaver memory 1 00 in 
the write mode of operation, and outputs the data stored in the interleaver 
memory 100 according to the read address provided from the address 
generator 150 in the read mode of operation. Alternatively, it is also possible 
to change the sequence of the input data bits before storing them in the 
interleaver memory in the write mode of operation, and sequentially read the 
stored data in the read mode of operation. 

[0008] If such an interleaver scheme is to be provided with a parallel access 
to the stored data in linear and interleaved order, multiport random access 
memory (RAMs) are used with K (K>1) reading ports. However, such 
multiport RAMs require large chip areas and are very expensive. Moreover, 
multiport RAMs with K reading ports may not be available from each ASIC 
(Application Specific Integrated Circuit) vendor, or the maximum number of 
available ports is at least limited at several vendors. 

SUMMARY OF THE INVENTION: 

[0009] It is therefore one objective of the present invention to provide an 
interleaving method and apparatus with a parallel access to the data in linear 
and interleaved order, which can be implement at reduced cost and chip area. 

[0010] According to one embodiment, the invention provides an interleaving 
method for providing parallel access in linear and interleaved order to a 
predetermined number of stored data samples. The method comprises the 
steps of: storing the data samples in a memory array comprising a plurality of 
memory devices; using a first portion of an address of the memory array to 
address the memory devices; using a second portion of the address to select at 
least one memory device to be accessed; and changing the position of the first 
and second portions within the address, when the access order is changed 
between the linear order and the interleaved order. 
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[0011] According to another embodiment, the invention provides an 
interleaving apparatus for providing parallel access in linear and interleaved 
order to a predetermined number of stored data samples. The apparatus 
comprises: a memory array with a plurality of memory devices for storing the 
data samples; addressing means for addressing the memory devices by 
applying a first portion of an address to the memory devices and by using a 
second portion of the address to select at least one memory device to be 
accessed; and change means for changing the positions of the first and second 
portions within the address in response to a change between the linear order 
and the interleaved order. 

[0012] Accordingly, by splitting the data memory into several smaller 
memories and changing the address portions in the interleaved order and linear 
order, each of the smaller memories can be accessed in linear and interleaved 
order without requiring multiport memory devices with several reading ports. 
During the linear access order, data symbols or samples of each data block can 
be accessed in a sequential order from each of the memory devices one after 
the other, while in the interleaved access order, data samples can be randomly 
accessed from the memory devices. Due to the changeable addressing scheme 
and split memory arrangement, each of the memory devices may need only 
one reading port. 

[0013] The parallel access to the plurality of memory devices can be 
performed in a multiplex manner using the second address portion as a 
multiplexing index. Then, all memory devices can be accessed in a 
multiplexed manner within one clock cycle, such that the number of clock 
cycles for parallel reading can be reduced according to the degree of 
multiplexing, i.e. the number of multiplexed accesses within one clock cycle. 

[0014] The second address portion may correspond to a predetermined 
number of most significant bits of the address during a linear order access and 
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may correspond to a predetermined number of least significant bits of the 
address during an interleaved order access. In this case, the first address 
portion may correspond to the remaining bits of the address. Thereby, in the 
linear access order all data to be accessed from one memory device is 
sequentially read before the data of the next memory device is read, while in 
the interleaved access order each memory device is accessed only once before 
the next memory device is accessed. 

[0015] The first address portion may be subjected to an interleaving process 
during an interleaved access order. Thereby, each memory device is accessed 
in accordance with the interleaving scheme, while a selection of the memory 
devices is performed based on the second address portion. As an example, the 
first address portion may comprise ten address bits and the second address 
portion may comprise two address bits. Thereby, a maximum number of 1024 
memory locations of each memory device can be addressed, while four 
memory devices can be selected. 

[0016] The first address portion may be generated by an address counting 
function. 

[0017] Furthermore, the memory devices may be single-port RAM devices. 
The apparatus may be integrated in a single chip device. 

[0018] An interleaving means may be provided for interleaving an output 
address of an address counter to generate the first address portion during an 
interleaved access order. In particular, the interleaving means may comprise 
an address translation ROM (Read-Only Memory). 

[0019] The change means may comprise a controlled switch for receiving 
the address and for switching the first and second address portions to 
respective output ports in response to an access order selection signal. 

[0020] Further advantageous modifications are described in the dependent 
claims. 
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BRIEF DESCRIPTION OF THE DRAWINGS: 

[0021] In the following the present invention will be described in greater 
detail based on a preferred embodiment with reference to the accompanying 
drawings in which: 

[0022] Fig. 1 shows a schematic block diagram of a conventional 
interleaver; 

[0023] Fig. 2 shows a RAM access scheme in a linear addressing mode 
according to an embodiment of the invention; 

[0024] Fig. 3 shows a RAM access scheme for an interleaved addressing 
mode according to an embodiment of the invention; 

[0025] Fig. 4 shows a table indicating an access scheme of the interleaving 
apparatus according to an embodiment of the invention; and 

[0026] Fig. 5 shows a schematic block diagram of the interleaving apparatus 
according to an embodiment of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS: 
[0027] The preferred embodiment will now be described on the basis of an 
interleaver with parallel access to data in an interleaved and linear order, 
which may be applied in a turbo decoder. In particular, the interleaver 
comprises four RAM devices Ml to M4 and sixteen register memories to store 
sixteen data values or symbols Bl to B4, CI to C4, Dl to D4 and El to E4 
read from the four RAM devices Ml to M4 in four clock cycles. These sixteen 
data symbols B 1 to E4 can then be used in the next four clock cycles in four 
constitutional decoders of the turbo decoder. 

[0028] Fig. 2 shows a corresponding RAM access scheme in the linear 
addressing mode where the four consecutive blocks of four data symbols B 1 to 
E4 are accessed in four clock cycles from each of the RAM devices Ml to M4 
based on an address output from a 10-bit address counter 40. 
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[0029] In the present example according to the preferred embodiment, an 
array of N data symbols is divided in K blocks, where K is a power of 2. The 
interleaving scheme will be explained based on the present example, where K 
= 4 and N = 3800, which means that 3800 data symbols are divided to four 
blocks and stored in the respective RAM devices Ml to M4. Hence, each of 
the RAM devices Ml to M4 contains 3800/4 = 950 data symbols. The whole 
memory array consisting of the four RAM devices Ml to M4 is accessed by 
using 12 address bits of which 10 bits are used to access the RAM devices Ml 
to M4. 

[0030] In the linear access order shown in Fig. 2, the first RAM device Ml 
is first selected based on the most significant address bits Al 1 and A10 of the 
12-bit array address, while the remaining address bits A9 to AO are used to 
sequentially select the data symbols Bl to B4 during the first clock cycle. In 
the second clock cycle, the second RAM device M2 is selected by the most 
significant address bits All and A10 to sequentially read the following four 
data symbols CI to C4. Then, the third RAM device M3 is selected and 
addressed during the third clock cycle to sequentially read the four data 
symbols Dl to D4, and the fourth RAM device M4 is selected and addressed 
during the fourth clock cycle to sequentially read the data symbols El to E4. 
Accordingly, in this example, the address counter 40 is controlled to cyclically 
generate a sequence of four 10-bit addresses for addressing the respective 
memory locations in which the data symbols are stored. 

[0031] In the linear access order of the above case of N=3800 data symbols, 
the linear addressing can be obtained by providing the 10-bit address counter 
40 with a counting function for counting up from 0 to a maximum address 
value of 949 depending on the number of data symbols in each data block. 
Furthermore, a 2-bit RAM multiplex index generates the two most significant 
address bits All and A 10 for a parallel access to respective output registers. 
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[0032] Due to the fact that not all possible 4096 addresses are used, but only 
a number of 3800, the decimal address values of the second to fourth RAM 
devices M2 to M4 can be calculated according to following equation: 

A M = A 0 + (1024-950) x (Aq/950) 

wherein A M indicates the decimal value corresponding to the 12-bit address of 
the whole array including the 2-bit RAM multiplex index and A 0 indicates the 
direct index of the data symbols without considering the address gaps caused 
by the partial usage of the memory capacity. Hence, the symbol index A 0 
ranges from 0 to 949 in the first RAM device Ml, from 950 to 1899 in the 
second RAM device M2, from 1900 to 2849 in the third RAM device M3, and 
from 2850 to 3799 in the fourth RAM device M4, while the modified address 
or array address A M ranges from 0 to 949 for the first RAM device Ml, from 
1024 to 1973 for the second RAM device M2, from 2048 to 2997 for the third 
RAM device M3, and from 3072 to 4021 for the fourth RAM device M4. It is 
noted that the modified address A M corresponds to the array address where the 
two most significant bits All and A10 are set according to the multiplex index 
used for selecting the respective one of the RAM devices Ml to M4. 

[0033] Fig. 3 shows a RAM access scheme for four consecutive symbols in 
four clock cycles in an interleaved addressing mode achieved by supplying the 
memory address generated by the 10-bit address counter 30 to the RAM 
devices Ml to M4 via an interleaver addressing ROM 10 which converts the 
linear or sequential address into a random address according to a 
predetermined interleaving scheme. As can be gathered from Fig. 3, the first 
four data symbols B 1 to B4 are now no longer obtained solely from the first 
RAM device Ml, but from each of the four RAM devices Ml to M4, wherein 
the first data symbol Bl is obtained from the first RAM device Ml, the second 
data symbol B2 is obtained from the third RAM device M3, the third data 
symbol B3 is obtained from the second RAM device M2, and the fourth data 
symbol B4 is obtained from the fourth RAM device M4. The same applies to 
the remaining blocks of consecutive data symbols CI to C4, Dl to D4 and El 
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to E4. It is noted that this non-linear addressing scheme is generated by 
applying a corresponding sequence of array addresses which determines the 
selection of the RAM devices Ml to M4 and their respective memory 
locations. 

[0034] According to the preferred embodiment, the interleaving scheme can 
be achieved by supplying ten address bits of the 12-bit array address to the 
interleaver addressing ROM 10 and using the remaining two address bits for 
generating the multiplex index for the RAM selection function. In the 
preferred embodiment, the address for the interleaving access order can be 
derived from the 12-bit array address as follows: 

INTERL(A0, Al), INTERL (Al 1, A 10, A9, A8, A7, A6, A5, A4, A3, A2) 

[0035] Accordingly, the two least significant bits AO and Al of the 12-bit 
array address are now used as multiplex index for RAM selection, which may 
be obtained from an optional 2-bit interleaver, and the ten most significant bits 
are supplied to the interleaver addressing ROM 10 to generate the RAM 
address for addressing the RAM devices Ml to M4. The described interleaver 
function INTERL(x) which determines the content of the interleaver 
addressing ROM 1 0 can be based on any standard interleaver function, such as 
the interleaver function used in the UMTS Turbo-Decoder according to the 
ETSI specification. 

[0036] Due to the fact that the two least significant bits AO and Al are now 
used for RAM selection, consecutive data symbols are read from different 
ones of the RAM devices Ml to M4 as indicated in Fig. 3. 

[0037] In the cases of Fig. 2 and 3, where four consecutive symbols are read 
from each block in linear and interleaved order, respectively, which leads to 
the following reading order of symbol indexes in the linear access mode, 
assuming that the first memory addresses are used in each of the RAM devices 
Ml to M4: 
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Ml: symbol index: 0,1,2,3 which corresponds to the modified or 

array addresses 0,1,2,3; 
M2: symbol index: 950, 951, 952, 953 which corresponds to the 

modified addresses 1024, 1025, 1026, 1027; 
M3: symbol index: 1900, 1901, 1902,1903 which corresponds to the 

modified addresses 2048, 2049, 2050, 2051; and 
M4: symbol index: 2850, 2851, 2852, 2853 which corresponds to the 

s modified addresses 3072, 3073, 3074, 3075. 

[0038] As can be gathered from the above generation scheme of the 
interleaving address, the interleaving address only depends on the 10 most 
significant bits All to A2 of the modified address A M . Therefore, in present 
case where only the first four consecutive symbols of each RAM device are 
read, the ten most significant address bits will not be influenced during 
address counting of each RAM device Ml to M4. In particular, the ten most 
significant bits of the modified address A M all remain "0" for the first RAM 
device Ml, correspond to the decimal value "512" for the second RAM device 
M2, correspond to the decimal value "256" for the third RAM device M3, and 
correspond to the decimal value "768" for the fourth RAM device M4. The 
interleaver function which may be extracted from the UMTS Turbo Interleaver 
generates the following four addresses at the output of the interleaver 
addressing ROM 10: 

INTERL(O) = 933 

INTERL(256) =313 
INTERL(512) =764 
INTERL(768) = 65 

[0039] Fig. 4 shows a table indicating the access scheme of the interleaver 
according to the preferred embodiment for the above example of an access of 
four consecutive symbols in four clock cycles. 
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[0040] Starting from the left side of the table, the first column indicates a 
block index of the read data block, the second column indicates the clock cy- 
cle, the third column indicates the symbol index in the linear access order, the 
fourth column indicates the symbol index in the interleaved access order, the 
fifth column indicates the modified address or array address in the linear ac- 
cess order, the sixth column indicates the modified or array address in the in- 
terleaved access order, the seventh column indicates the number of the ac- 
cessed RAM device in the linear access order, the eighth column indicates the 
multiplex index or address of the accessed RAM device in the linear access 
order, the ninth column indicates the number of the accessed RAM device in 
the interleaved access order, and the tenth column indicates the 10-bit RAM 
address applied to the RAM devices in the interleaved access order which cor- 
responds to the output of the interleaver addressing ROM 10. 

[0041] As can be gathered from the above access scheme of Fig. 4, a change 
from the interleaved to the linear access order and vice versa is simply 
achieved by changing address portions used for RAM addressing and RAM 
selection, and supplying the RAM address to an interleaver functionality dur- 
ing the interleaved access order. 

[0042] Fig. 5 shows a schematic block diagram of an implementation exam- 
ple of the interleaver with parallel access in linear and interleaved order ac- 
cording to the preferred embodiment. 

[0043] An address counter 40 operates according to a linear counting 
scheme to generate a 10-bit address comprising address bits A9 to AO, and a 
2-bit index counter 30 outputs a cyclic 2-bit multiplex index comprising two 
bits Al 1 and A 10. The twelve output signals of the address counter 40 and the 
index counter 30 are supplied to a controllable switching unit 20 having a de- 
multiplexing functionality for either supplying the most significant address 
bits All to A2 at a first port and the least significant address bits Al and AO at 
a second output port, or supplying the most significant address bits All and 
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A 1 0 at a third output port and the least significant address bits A9 to AO at a 
fourth output port, based on a control signal L/I used for selecting a linear or 
interleaved access order. The output ports may be arranged in a tri-state tech- 
nology, such that the two non-used output ports are set to a high impedance 
level. 

[0044] If the linear access order is selected by the control signal L/I, the ten 
least significant address bits A9 to AO which correspond to the output of the 
address counter 40 are applied in parallel to respective address ports AD1 to 
AD4 of the RAM devices Ml to M4, and the most significant address bits Al 1 
and A 10 which correspond to the output of the index counter 30 are used as 
multiplex signals and are supplied to respective multiplexing inputs of corre- 
sponding register arrays 5 1 to 54 provided at the outputs of the RAM devices 
Ml to M4. The read data symbols are output from the RAM devices Ml to M4 
to the register arrays 5 1 to 54 in which the consecutive data symbols of the 
data blocks, e.g. Bl to B4, CI to C4, Dl to D4 and El to E4, are temporarily 
stored for further processing. 

[0045] If the interleaved access order is selected by the control signal L/I, 
the most significant address bits All to A2 are supplied via the interleaver 
addressing ROM 10 which maps the above interleaving function 
INTERL(A11-A2) to the address ports AD1 to AD4 of the RAM devices Ml 
to M4. Furthermore, the least significant bits Al and AO are supplied to the 
multiplexing inputs of the register arrays 51 to 54. Thereby, an interleaved 
access scheme as indicated in Fig. 3 can be obtained. 

[0046] It is noted that other suitable controllable switching functions may be 
applied for changing the allocation of the address bits to the multiplexing 
function and the RAM addressing function. Furthermore, the interleaver ad- 
dressing ROM 1 0 may be any kind of look-up table or logic suitable to im- 
plement the required interleaving function, and a corresponding additional 
interleaving function may be provided for the least significant bits AO and Al 
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in the interleaved access order. Moreover, any suitable allocation of address 
portions or address bits can be used to change between the linear and the inter- 
leaved access order. The single-port RAM devices Ml to M4 may be replaced 
by two dual-port RAMs. In case of a higher number of parallel output values, 
even two or more multiport RAMs may be used, while still reducing overall 
complexity and size of the interleaver. The preferred embodiment may thus 
vary within the scope of the claims. 
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